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Exemplary Starting Keys 


SMARTS Query 


Minimum 
hits 


Weight 


Comment and corresponding 
MACCS definition, if any 


[R2;r5]~[R2;r5] 


1 


7 


1 Any double ring, structure with 
the smallest ring a 5 member ring 


[R2;r6HR2;r6] 


1 


7 


2 Any double ring structure with 
the smallest ring a 6 member ring 


U&K-rAUrAMrAVrAI 
'Jl' T Jl' T J L 1 T J 




inn 


O rtAAAi»H 


[#4,#12,#20,#38,#56,#88] 


1 


100 


4 Group IIA 


[r4] 


1 


100 


5 4 M ring 


[#29,#30,#47,#48,#79,#80] 


1 


95 


6 group IB.IIB 


[#8]~[#7](~[#6])~[#6] 


1 


100 


7 ON(C)C 


[#16]-[#16] 


1 


100 


8S-S 


[#8]-[#6](-[#8])-[#8] 


1 


100 


9 OC(0)0 


[!#6]1-*-*1 


1 


100 


10QAA@1 


C#C 


1 


85 


11 CTC 


[#5,#13,#31,#49,#81] 


1 


76 


12 Group MIA 


[r7] 


1 


90 


13 7 M ring 


[#14] 


1 


40 


14 Si 


[#6]=[#6](~[!#6])~[!#6] 


1 


56 


15C=C(Q)Q 


[r3] 


1 


80 


16 3 M ring 


[#7]~[#6](-[#8])-[#8] 


1 


18 


17 NC(0)0 


[#7]-[#8] 


1 


9 


18 N-0 


[#7]-[#6](-[#7])-[#7] 




45 


19NC(N)N 


[#6;R]=[#6;R](@*)@* 




35 


20 C$=C($A)($A) 


[#53] 




40 


21 I 


f!#6lfCH2lN#fil 




F.7 


?? nr,H?n 


[#15] 




20 


23 P 


[#6]-[!#6](-[#6])(-[#6])-* 




29 


24 CQ(C)(C)A 


[!#6]~[#9,#17,#35,#53] 





23 


25 QX 


[#6]-[#16]~[#7] 




50 


26 CSN 


[#7]-[#16] 




46 


27 NS 


[CH2]=, :* 




26 


28 CH2=A 


[#16;r] 




30 


29 S heterocycle 


[#7]-[#6](-[#8])-[#7] 




12 


30 NC(0)N 


[#7]~[#6](-[#6])-[#7] i 1 

1 


20 


31 IMC(C)N J 

i ,--\ (-■ 


r 44 n i 

[[!#6;H1,H2.H3]"[!#6;H1,H2,H3] 

[#6]V[#6h[#7] 

l[#35] 

'it 1 1]^ - * - ■;; ; 


1 
1 


10 
14 


J r 

36 QHAQH I 

37 C=CN 

38 Br ~~ ~ ' 
SAN 
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Exemplary Starting Keys Cont. 


SMARTS Query 


Minimum 
hits 


Weight 


Comment and corresponding 
MACCS definition, if any 


[#8]~[!#6](~[#8])~[#8] 


1 


10 


40 OQ(0)0 


[ + ++ + ++] 


1 


6 


41 charge 


[#6]=[#6](-[#6])~[#6] 


1 


11 


42 C=C(C)C 


[#6]~[#16]~[#8] 


1 


14 


43 CSO 


[#7H#7] 


1 


12 


44 NN 


[!#6;H1 ,H2,H3]-*-*~*[!#6;H1 ,H2,H3] 


1 


10 


45 QHAAAQH 


[!#6;H1 ,H2,H3]~*--*[!#6;H1 ,H2,H3] 


1 


8 


46 QHAAQH 


[#8]~[#16]~[#8] 


1 


13 


47 OSO 


[#8]-[#7](-[#8])~[#6] 


1 


11 


48 ON(0)C 


[#8;r] 


1 


8 


49 0 heterocycle 


[!#6]~[#16]-[!#6] 


1 


12 


50 QSQ 


[#16]!:*:* 


1 


12 


51 Snot%A%A 


[#16]=,:[#8] 


1 


13 


52 S=0 


*-[#16](-*)~* 


1 


12 


53 AS(A)A 


*@*!@*@* 


1 


11 


54 A$A!A$A 


[#7]=,:[#8] 


1 


11 


55 N=0 


*@*!@[#16] 


1 


11 


56 A$A!S 


[#6]:[#7] 


1 


12 


57 C%N 


[#6][#6]([#6])([#6])* 


j 


9 


58 CC(C)(C)A 


[!#6]-[#16] 




11 


59 QS 


[!#6;H1 ,H2,H3]~[!#6;H1 ,H2,H3] 




8 


60 QHQH(&..) 


[!#6]-[!#6;H1,H2,H3] 


1 


8 


61 QQH 


'[!#0] '■[#?] -[!#G] ■ 




<j 


02 QNQ * 


[#7]~[#8] 




9 


63 NO 


[#8]~*~*~[#8] 




7 


64 OAAO 


[#16]=,:* 




8 


65 S=A 


[#6H3]~*~[#6H3] 




6 


66 CH3ACH3 


*!@[#7]@* 




8 


67 A!N$A 


[#6]=[#6](-*)~* 




6 


68 C=C(A)A 


[#7]-*-[#7] 




5 


69 NAN 


[#6]=[#7] 




6 


70 C=N 


[#7]-*-*-[#7] 




6 


71 NAAN | 


[#7;H2,H3.H4] 
[#6]-[#7](-[#6])-[#6] 


1 
1 


6 
5 


UAAAACa ' , 
76 NH2 

77CN(C)C j 

\H CH2QCK" 
• . . - ASA 



J 

Figure 3c 



Exemplary Starting Keys Cont. 


SMARTS Query 


Minimum 
hits 


Weight 


Comment and corresponding 
MACCS definition, if any 


[#16] 


1 


5 


80 S 


[#8]~*-*~*~[#8] 


1 


4 


81 OAAAO 


[!#6;H1 ,H2,H3]~*~*~[#6;H2]~* 


1 


4 


82 QHAACH2A 


riiffi-MI M9 l-ni~*-*-*M£R l-l91* 
l""' 1 ■ • <• - "J !."->■ —J 




A 


83 QKAAACH2A 


[#8]-[#6](~[#7])-[#6] 


1 


4 


84 OC(N)C 


[!#6;!#1]-[CH3] 


1 


4 


85 QCH3 


[!#6;!#1]-[#7] 


1 


4 


86 QN 


[#7]-*-*-[#8] 


1 


4 


87 NAAO 


[r5] 


1 


4 


88 5M ring 


[#7]-*-*-*-[#8] 


1 


5 


89 NAAAO 


[!#6]1 -*-*-*•-*-* 1 


1 


5 


90 QAAAAA@1 


[#6]=[#6] 


1 


4 


91 C=C (does not hit aromatic) 


*-[#6H2]~[#7] 


1 


4 


92 ACH2N 


[r8,r9,r10,r11,r12,r13,r14] 


1 


4 


93 8M ring or larger 


[!#6]-[#8] 


1 


3 


94 QO 


[#17] 


1 


4 


95 CL 


[!#6;H1,H2,H3]-*-[CH2]-* 


1 


4 


96 QHACH2A 


*@*(@ # )@* 


1 


4 


97 A$A($A)$A 


[!#6;!#1 ]-*(~[!#6;!#1 ])~[!#6;!#1 ] 




2 


98 QA(Q)Q 


[#9,#17,#35,#53]-*(-*)-* 




3 


99 XA(A)A 


[CH3]~*~*-*-[CH2]-[!#1] 




4 


100 CH3AAACH2A 


*~[#6;H3,H2]-[#8] 




4 


101 ACH20 






3 


102 NCO 


[#7]-*-[#6;H2]-* 




4 


103 NACH2A 


*-*(-*)(-*)-* 




3 


104 AA(A)(A)A 


[#8]!:*:* 




4 


105 Onot%A%A 


[CH3]-[#6;H2]-* 




3 


106 CH3CH2A 


[CH3]-*~[#6;H2]~* 




3 


107 CH3ACH2A 


[CH3]-*-*-[#6;H2]~* 




3 


108 CH3AACH2A 


[#7]-*-[#8] 




2 


109 NAO 


*-[CH2][CH2]~* 


3 


3 


110 ACH2CH2A > 1 


[#7]=,:* [ 1 


3 


111 N=A 


|[#8]-[#6]-[#8] 

['#6]-['#6] 

a 


1 
1 
7 


3 
2 
2 


• - ^ '•<»* • > ■ - 

M AMlAiM 

115 0CO I 

1 16 QQ 

1 1 7 aromatic nnq > 1 I 



Figure 3d 



Exemplary Starting Keys Cont. 


SMARTS Query 


Minimum 


Weight 


Comment and corresponding 




hits 




MACCS definition, if any 


*@*!@[#8] 


2 


2 


119 A$A!0> 1 & ... 


*[CH2]*-*-*[CH2]* 


1 


3 


120 ACH2AAACH2A 


*-[#6&H2]-*-*-[#6&H2]-* 


1 


3 


121 ACH2AACH2A 




2 


2 


1 22 QQ ^ 1 


[!#6;H1,H2,H3,H4] 


2 


2 


123 QH> 1 


[#8]~*[CH2]* 


1 


2 


124 OACH2A 


*@*!@[#7] 


1 


2 


125 A$A!N 


[#9,#17,#35,#53] 


1 


2 


126 X (halogen) 


[#7]!:*:* 


1 


2 


127 Nnot%A%A 


[#8]=,:* 


2 


2 


128 0=A>1 


[!#6&r] 


1 


3 


1 29 heterocycle 


[!#6]-[CH2]-* 


2 


2 


130 QCH2A>1 & ... 


[#8;H1,H2] 


1 


2 


131 OH 


[#8] 


4 


2 


132 0 > 3 and other features .. 


[CH3] 


3 


2 


133 CH3 > 2 


[#7] 


2 


2 


134 N>1 


*@*!@[#8] 


1 


2 


135 A$A!0 


*!:*:*!:* 


1 


2 


136 Anot%A%Anot%A 


[r6] 


7 


2 


137 6 M ring > 1 


[#8] 


3 


2 


138 0>2 


*~[CH2]~[CH2]~* 


1 


2 


139 ACH2CH2A 


*-[!#6](-*)-* 


1 


2 


140 AQ(A)A 


rr.H.^i 


? 


2 


141 CH3> 1 


*!@*@*!@* 


1 


2 


142 A! ASA! A 


[#7;H1,H2,H3,H4] 


1 


2 


143 NH 


[#8]-[#6](-[#6])-[#6] 


1 


2 


144 OC(C)C 


[!#6][CH2]* 


1 


2 


145 QCH2A 


[#6]=,:0 


1 




146 C=0 


*!@[CH2]!@* 


1 




147 A!CH2!A 


[#7]~*(-*)~* 


1 




148 NA(A)A 


[#6]-[#8] 


1 




149 C-0 


[#6]-,:[#7] 

r U 9 1 


1 


1 _ 


1 50 C-N 

* r * r\ ' 


f[r6] 


1 


1 


1 55 6 member ring 


[#8] 


1 


1 


1 56 0 








1 ^7 r ino i 



( START ) 



50. 
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52 



\ 



1 
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r 
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Fig. 4a 



54 



\ 



CREATE AND 
INITIALIZE ATOM 
LABELS 



56, 



\ 



INITIALIZE TO FIRST 
LEARNED KEY 




ASSIGN 1 -BIT TO 
VECTOR 



-66- 



64 



\ 



SET EACH 
PARTICIPATING 
ATOM LABEL 



ASSIGN 0-BIT 
TO VECTOR 



62 




TRAINED SOM 
GRID 
(ACTIVES) 




FIT INACTIVES INTO 
CLUSTERS 



92 



Fig. 5 



INITIALIZE TO FIRST 
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94 




IDENTIFY AS 
HOT SPOT 



100 



/ 




ADVANCE TO 
NEXT CLUSTER 



( END ^ 



110 



\ 



( START ) 



INITIALIZE TO FIRST 
HOT SPOT 



Fig. 6 



112 INITIALIZE TO FIRST 
MOLECULE IN HOT 
SPOT 



\ 



114 



\ 



INITIALIZE WEIGHT 
FOR EACH 
COMPONENT TO 
ZERO 



116 



\ 



INITIALIZE TO FIRST 
DISCRIMINATING 
FEATURE 





WEIGH EACH 
COMPONENT 
ACCORDING TO 
PARTICIPATION 


I ► 




1 



118 



ADVANCE TO 




NEXT 


^YES 


DISCRIMINATING 




FEATURE 




/ 




122 






126 



ADVANCE TO 
NEXT 
MOLECULE 



134 



ADVANCE TO 
NEXT HOT 
SPOT 



YES 



132 



YFS 




/ END ^ 



128 
/ 

I HND MAXIMUM 
►j COMMON 
I SUBSTRUCTURE 



WEIGH NEW KEY 

BASED ON 
COMPONENT 
WEIGHTS 



130 



( START ) 



INITIALIZE 
COUNTER 



Fig. 7 



140 



INITIALIZE TO 
FIRST MOLECULE 
IN CLUSTER 



142 



144 



NO 




148 



ADVANCE TO 
REXi 
MOLECULE 




MORE 
MOLECULES 



149 



YES 



INCREMENT 
COUNTER 



146 

V 




YES 



152 



ADD POTENTIAL KEY 
TO LIST OF NEW 
KEYS 



154 



\ 



REJECT 
POTENTIAL KEY 



T 



+^ end y* 



